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ABSTRACT 



In this technical report, a general formula for the ft-value 
of a 2x2 matrix-differential game is derived, using two 
different methods. 



Prepared by: 



1. Introduction, 



Consider a two-person zero-sum game with matrix [a..] which is 



being played for one unit of time. If in a time interval At player 1 



plays row i and player 2 column j the payoff to player 1 is a.. At, 



Payoffs are accumulated with time so that the total payoff is 



1 

* 

a . . d t . 
ij 

0 



Of course, if instantaneous changes of rows or columns were allowed the 

integral would not be defined, John Danskin [1] has proposed to resolve 

this dilemma by approximating the game by n-stage games with perfect 

information. In the n-stage approximation, player 2 is allowed to change 

the column index only at times t = — ; k = 0,l,...,n-l, while player 1 

n 

k o 

is allowed to change the row index only when t = * , where 0 £ a £ 1 

n n 

is a fixed parameter. The payoff is then 



1 

0 



a . . dt = 
iJ 



1 

n 



l 



k=l 



a a. + 

1 k-l J k 



(l-a)a 



Vk 



The row index i^ is called the predecessor. We denote this game 

and call it the n-stage ft- game with predecessor i^. Its value 



ft (a;i A ) = min max 
n U . 



i n 

min max — ) o a. , + (l-a)a. . 

j i - k-1 Vi J k 

n n 



Vk 



d.i) 



John Danskin has proved (in a far more general context) that the sequence 



fi 1 (a;i Q ) ,fl 2 (cr;i 0 ) , . . . 

converges to a limit which does not depend on the predecessor i^ . 



( 1 . 2 ) 

This 



limit 



2 



ft(a) = lim ft (a;i.) 

n u 

n-H» 

is then called the fi-value of a matrix-differential game [a_]. 

The main purpose of this technical report is to establish the 
following formula for the ft-value of a two-by-two game. 



Let A = 



a ll a 21 



ua 12 a 22- 



be a matrix, let I = {1,2} be the index set, 



0 £ a £ 1. Denote M = max max {a..}, M = min min {a..}, 



i€I j€I 



i€I j€I 



V = min max {a..}, V = max min {a..}, and 



j€I 161 






i€I j€I 



ij 



v(a) = y[a( M+V) 4- (1-a) (M+V) ] . 



Then 



V if v(a) £ V, 

ft(a) = | v(a) if V_ < v(a) < V, 

V if v(a) ^ V. 



We provide two independent proofs of this formula, each using a 
different approach — Shubert’s (Sec. 2) or Washburn T s (Sec. 3). These 
sections can be read independently. The notation is common. A brief 
comparison and discussion of the two approaches is contained in Sec. 4 
together with some ideas about possible generalization to larger games 



let 



(1.3) 



two 



3 



2 . Shubert's Approach . 

From (1.1) it is easily seen that satisfy the recurrence 

relation 

ft ,,(cr;k.) = min max{— j-r [aa . + (l-a)a..] + — 77 - ft (c;i)}, k £ I, ( 2 . 1 ) 
n +1 i€I n +1 kj ij n +1 n 

with initial condition = 0 . This relation could be used to evaluate 

Q(a), which can be done on a computer provided numerical values for a 

and the entries a.. are given. 

ij 

However, if we wish to obtain the entire function ft( • ) and establish 
some of its properties, the recurrence relation ( 2 . 1 ) may be of little help. 
An analogy with similar problems suggests that perhaps £}(•) can also be 
found as a solution of a functional equation obtained from ( 2 . 2 ) by letting 
n Unfortunately, such a process would yield the equation 

ft(* ;k) = min max ft( # ;i), 

j i 

which is satisfied by any function such that ft(a;k) = for k f l. 

In what follows we are going to show that the difficulty can never- 
theless be circumvented by solving the equation 

Q(a;k) = min max {X [era, .+(l-a)a . . ] + (l-A)ft (a ; i) } , k € I, (2.2) 
j€I i€I J 1J 

where 0 < X £ 1 is a parameter. 

We first prove that this equation has a unique solution, depending, 
of course, on the parameter Next we show that as \ -*• 0 this solution 

converges to the ft-value. As a byproduct we obtain a new proof for the 
convergence of the sequence (1.2). Finally, by actually finding the solu- 
tion of (2.2) for small X > 0 and letting X -* 0 we establish the formula 



(1.3) for the ft-value. 
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We begin with some definitions. To simplify notation we temporarily 
suppress a and abbreviate 



h. . = oa, . + (l-o)a. . , 
ij kj ij 



where i 6 I, j 6 I, k € I. 

r\ 

Let x = ^ R e be a two com P onent vector, let 0 ^ A ^ 1. 

Denoting 

H k .(X,x) = Ah k . + (l-A)x . 
ij - iJ i 

i € 1, k € I, j € I, 

we define a parametric family of mappings 



by 



Let X-l >i 2 * ’ ' 



T(A,x) = (T 1 (X;x) ,T 2 (X,x)) 



T (X ,2i) = min max H . . (X ,>0 ; k £ I. 

k j€I i€I 13 

be a sequence of vectors from Re 2 defined by 



(2.3) 



6 Re 2 , x , a+1 ~ T ^ , ^n^ n = 



(2.4) 



Comparing with (2.1) we see that with = — 



| (fi^(ojl) ,fi^(o^2)) , 



so that 21-^ 2 5 * * • indeed the sequence defining the ft-value of the 

ft-game with matrix A. 

Next, let ||*|| be the maximum norm on Re 2 



I lil I = max{ | Xl | , |x 2 | }, 
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let a be a constant such that 



Let 



max max 

i€I j€I 




£ a . 



X = {x € Re 2 : | |x| | £ a} 
We are now ready to state the following theorem 



Theorem 1 : Let 0 X ^ 1, let T be the mapping defined by (2.3) and 

restricted to the domain X, let 2 £-l> 2£2 , * , ‘ be t ^ Le sec l uence (2.4). Then 
the following is true. 

(1) For every 0 < X £ 1. there is a unique fixed point 

x(A) = T(A ,2<(A)) . 



(2) The limit 



lim xXX) = i = (£, ,£,) 
A-K) 



exists and = ^2‘ 



(3) The sequence — q *—2 9 * * * converges and 



lim x = lim x(X) . 
ir*» ~" n x^O 
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Proof : The proof is divided into a sequence of eight lemmas. Lemma 3 

proves statement (1) , Lemma 7 statement (2) and Lemma 8 statement (3) . 
We would like to point out that the proof may seem rather uneconomical 
for the two-by-two case (for instance, Lemma 3 can be proved without 
using Banach's theorem). 

We, nevertheless, leave it in its present form with an eye on 
possible generalization. 

Lemma 1 : For any 0 ^ X £ 1 

x € X =* T(A,x) € X 

Proof : 



|t(X ,x) 1 


| = max{ 


|T 1 (X,x) 


|,|t 2 (x,x)|} 


^ max 


max max 


l xh ii + 


(l-X)x | 


k6I 


i€I j€I 




l 1 


^ A a + 


(1-X) | |x 


.11 ^ a 


if x € X. 



Lemma 2 : For any 0 £ X £ 1 

| | T (A ,x) -T(X,x')|| * (1-X) | |x-x' | | 

Proof : If and (g^,g 2 ) are any two real vectors then 

| max{ f x > f - max{g 1 ,g 2 } s£ max{ | f 1 ~z 1 | , | f 2 ~g 2 1 } , 

and 

|min{f ,f 2 > - min{g 1 ,g 2 }| <: max{ | f 1 “g 1 | , | f 2 ~g 2 j } • 
Using these two inequalities in the reversed order we obtain 
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| | T ( A »2S.) - T(X,x/)|| = max | min max H^.(X, 2 c) - min max H^.(X, 2 c')| 

k€I j€I i6I 1J j€I i€I 1J 



k k 

£ max max | max H..(X,x) - max H . . (X ) | 
k€I j€I i€I 1J i6I 1J 



i lc 1c I 

^ max max max | H . . (X ,x) ” H . . (X >x ' ) | 

k€I j€I i€I 1J 



= max | (1-X) (x. -x! ) | = (1-X) | |x - x' | | . 
i€I 1 1 



Lemma 3 : For every 0 < X 1 there is a unique x £ X such that 

T (X y x) = x. 



Proof : By Lemma 2 the map T is for any fixed 0 < X £ 1 a contraction 
and by Lemma 1 it maps a closed bounded set X into itself. Hence by 
Banach’s fixed point theorem it must have a unique fixed point. 



Lemma 4 : 
either 



or 



or 



or 



For every 0 < X £ 1 



x^X) 




x^X) 




x x (X) 

x 2 (X) 

x x (X) 



there is p € 1 and q 6 I 



? 2 (X) - h‘ p +X(h2 q -hl p ), 



? 2 (X) ■= h 2 2q + X(h> p -h2 q ), 



h* + h 2 - Xh 2 

2p iq iq 

2 - X 



h* + h 2 - Xh* 
_2£ Iq 2p 

2 - X 



h lp * *2 (X) = h 2q* 



such that 

(2.5) 

( 2 . 6 ) 

(2.7) 



( 2 . 8 ) 
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Proof : Since x = (x^ y x^) is a fixed point of the map T(X,*) we must 

have 

x = h* + (l-X)x. , 

1 ip l* 

*2 »J q + W-» S i- 

where i 6 I, j € I, p 6 I, q £ I, Choosing values 1 or 2 for the 
index i and the index j we obtain four sets of pairs of linear equations 

= Xh ip + 

*2 = Xh i q + 

= Xh^ + (1-X)x 2 
*2 = Xh 2q + (1_X)Sr 2 

*1 = Xh 2p + (1_X)5 2 

x 2 = Xh^ + (1— X)x^ 

x = Xh} + ( 1— X ) xl 
1 lp 1 

x 2 = Xh^ + (1-X)x 2 

The solution of each set gives the corresponding expression (2.5)— (2.8) . 

Lemma 5 : lim | ST ( X ) - x^(X)| = 0. 

X+0 1 2 

Proof: Assume the statement is false, i.e. there is > 0 such that 

for every X^ > 0 there is 0 < X < X^ for which 

Ix^X) - x 2 (X) | £ e Q . 
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By Lemma 4 this could only be if x-(A) is given by (2.8) i.e. 



X 2^ h lp ’ X 2^ h 2q 



and 



(2.9) 



l h lp h 2ql * £ 0‘ 



( 2 . 10 ) 



Next since x:(A) is a fixed point 



x (A) = min max H . . (A ,x) , k ^ I, 

k j€I i€I 1J 



and because of (2.9) this implies 



and 



H* p (A >£> ^ H 2 p >E) > 



H ^q(A ,x) ;> (A ,x) . 



Substituting from (2.9) we obtain 



and 



Xhlp + (l-x)h| p * xh> p + (1-X)h2 q> 



Ah2 + (1-A)h2 ;> Ah? + (l-A)hJ , 

2q 2q lq lp 



which is equivalent to 



h lp h 2q ^ X(h 2p h 2q ) ’ 



h. 1 - h? £ A (h. 1 -h? ) . 

lp 2q lp 2q 



However, these two inequalities cannot hold for arbitrarily small A > 0 
unless h* = h| which contradicts (2.10). The lemma is proved. 
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Lemma_6 : Let = (f^ f^) and ^ = ( g;L , . . . , g ) , j = l,...,n 

be given vectors. Let for every 0 < A £ A^ at least one of the n 

inequalities A_f^ ^ _g^ be satisfied. 

Then there is a Aq € (0,A^] such that if the inequality 

Af. ^ g. is satisfied for A = A~ it is satisfied for any 0 < A £ A^. 

^0 % 0 0 

Proof: Since all these inequalities are linear there is, for every j, 

a closed interval J d [0,A^] such that the j-th inequality is satisfied 

if and only if A £ J_,. Although some of those intervals may be vacuous 

the system must cover (0,A^] since by assumption at least 

one of the inequalities is satisfied for any A ( (0,A^] . Hence, there 

must be at least one interval with nonempty interior and left endpoint 

zero, say J. = [0,A n ], A > 0, and the statement follows. 

J 0 u u 

Lemma 7 : There exists Aq > 0, a vector g_ = and a vector 

su °h that either 

x\A) = JL + Ab for all 0 < A ^ Aq, (2.11) 

or 

x(A) = ^ 2 + _ A ~ for all 0 < A £ A Q . (2.12) 

In either case the vector is such that 

Ci = c 2 - 

Proof: By Lemma 5 the fixed point 2 i(A) is given by one of the expressions 

(2.5)— (2.8) . Since p and q generally depend on A this leaves 16 



possibilities . 



11 



Since x(\) is a fixed point there is p f I, q £ I, r 6 I , 
s 6 I, in general depending on X, such that 



X- = H 1 (X ,x) = min max H* . (X ,x) , 

1 pq " 161 161 - 

and 

x = H 2 (X,x) = min max H 2 .(X,x). 

2 rs j€I i€I 1J 

But A = min [max (A ,B) , max(C,D)] is equivalent to 



A ^ B and A £ C 



or 



A ^ B and A £ D . 



(2.13) 



Hence, (2.13) is satisfied (with some specific p, q, r, s) if and only 
if at least one of the resulting four systems of four inequalities between 
pairs of the H f s is satisfied. Now, each of these inequalities is of 
the form 

Xh 4- (l-X)x ^ Xh' + (l-X)x' , (2.14) 

k. » 

where h and h each stands for one of the h.. s and x and x 

iJ 

each for one of the two components of x. 

By Lemma 6 there must be X^ > 0 such that for 0 < X £ X^ x(X) 
cannot be given by (2.8) unless h| = h 2 ^. 

Hence, substituting for x* and x* 1 any one of the expressions 
(2.5)-(2.7) or the expression (2.8) with h|^ = h 2 ^, th e inequality (2.14) 
becomes either 

X (h-h 1 ) + (l-X)XB £ 0 
or 



(2-X)X (h-h 1 ) + (l-X)XB ^ 0 
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where B is a constant involving the h 's. But being positive, 

can be canceled and the inequalities become linear in X . 

Thus we have a finite number of systems of four linear inequalities 
and since the fixed point exists by Lemma 3 for every 0 < X £ 1 at 

least one of the systems must be satisfied for every 0 < X £ X^. Hence, 
by Lemma 6 there is a X^ > 0 such that if a particular system is satis- 
fied for X = Xq it remains satisfied for all 0 < X ^ X^. Consequently, 
the indexes p, q, r, s, which identify the fixed point for X = X^ 
will remain unchanged as X decreases to zero. Hence _x(^) will for 
every 0 < X ^ X^ be given by one of the expressions (2. 5) -(2. 7) with p 
and q no longer changing with X and thus by Lemma 5 must have one of 
two forms (2.11) or (2.12) with . The lemma is proved. 



Lemma 8 : Let x^ 9 x ^ 9 . . . be a sequence satisfying (2.4), let 

£ = lim x(X) . Then the sequence x ,x , . . . converges and lim x = £. 
X+0 1 z n-**> ~ n 

Proof : Let e > 0, let N be a positive integer such that 



1 

N 






(2.15) 



and 



5Pk •= t £or every n ‘ 

k=0 

— 2 

where K = max{ | |bj | , | |jj_-2bj j (2-X^) } and X^,]^ are those of Lemma 

Such an N exists for any e > 0 since the harmonic series is Cesaro 
summable to zero. 

Let n be a nonnegative integer. By Lemma 2 



(2.16) 

7. 



-Wnd I ll T ( N+n ’%+n-l^ “ T( -N-l-n 




£ 



N+n-1 

N+n 



^-%+n-l 




(2.17) 
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and by triangular inequality 



” - l 'M+n ) I I £ I I^N+n-1 — ^N+n-1^ I I + 



l£ ( sfc> 






By Lemma 7 if 0 < X £ Xq and 0 < X 1 £ Xq 



x(X) - x(X ') II <; |X - X ' 



or 



|x(X) - x(X') | | ^ |X - X' | |^x~)^ 



depending on which of the two expressions (2.11) and (2.12) applies. 
Hence 



-s ( sr^r ) 



K 



(N+n) (N+n-1) 



(2.19) 



Putting (2.17), (2.18) and (2.19) together and calling 



= A we have 
n 



. N+n-1 A 

A ^ —rr, A 



K 



n N+n n-1 (N+n) 4 



l%bn — ^N+n^ 



Iterating we obtain eventually 



K 



A n ^ N+n A 0 + N+n . L n k 

k=0 



n i 

y - 

L \r 



for every n = 0,1,... and hence by (2.15) and (2.16) and the obvious 

fact that 0 £ I I A II £ a, the radius of X, we have 

n 



A < e for every n = 0,1,... . 



n 



Thus lim A < e for any e > 0 so that lim | |x - x(~) 11=0 and 
n 1 '—n — n 1 1 



n-*» 



n -*» 



since by Lemma 7 lim x(— ) = £ the proof of Lemma 8 and also of Theorem 

— n 

n-*» 



1 is terminated. 
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Now we can proceed with the proof of the formula for the 
Theorem 2 : The formula (1.3) is true. 

Proof : Since always V £ V we distinguish two cases. 

Case 1 : V < V 

With no loss of generality assume that the matrix A is 

a ll ^ a 22 ^ max ^ a 12 > a 2l^’ 

relabelling the entries if necessary. Then we have 

M = a V = a 22 and V = max{a 12 ,a 22 } . 

We will again use the notation 

tu. = aa. . + (l-o)a. . and H^,(X,x) = Xh^\ + (l-X)x. 
ij kj ij ij ij k 

Notice that in this notation 

v (o) = ■j( h 22 +h ip 

k 

and that if k = i, h. . = a. . . 

ij iJ 

Case 1.1 : v(o) ^ a 22 

Let 

X 1 = Xh 22 + (1_X)h 22 ] 




ft-value. 



such that 



( 2 . 20 ) 
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so that 

H 22^»— ■ ^ = x i and H 22^’— = x 2 (2.21) 

Next 

H^ 2 (X,x) - H| 2 (X,x) = X(hl 2 - h } 2 ) + (l-X)(x 2 -x 1 ) 

= X[hJ 2 - hj 2 + ( 1— X ) (h 22 “h 22 ) ] , 

which is nonnegative in a positive neighborhood of zero if and only if 
h 22 ” h 12 ^ °* But h 22 = a 22 ’ h l2 = a l2 S ° that indeed 
H^CX.x) - H^ 2 (X,2i) ^ 0 for small X > 0. 



Next 



H^U.x) ' H^CX.x) = X [h 22 - hj 2 + (1-X) (h| 2 -hl 2 ) ] £ 0 

if and only if h| 2 £ h|^ , which is true since h| 2 = a 22 and hj^ = 3 -q ■ 
Thus for small X > 0 



H 22^’— ^ = min max H £-(^>2i) 
j i 1J 



( 2 . 22 ) 



Similarly 



and 



H 22 - H 12 * 0 “ 2h 22 - (h 12 +h 22 ) 2 0 » 2a 22 * a 22 + a 12' 



H 22 " H ii * 0 ° 2h 22 " 2v(a) * °* 



which is true since h^ = a^. Hence, for small X > 0 



H^.(X,2c) = min max H^.(X,x), 
j i J 
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which together with (2.21) and (2.22) means that for all X > 0, in some 
neighborhood of zero (2.20) is a fixed point of the mapping (2.3). Conse- 
quently by Theorem 1 the ft-value 



Case 1.2 : 

Let p € I 



Q ( o) - lim x = hi - a . 

X-+0 

v(o) 2: V 

and q € I be such that V = a 

- pq 



so that 



v(a) 



■ 

2 qq 



+ h q ]. 

PP 



Notice that p ^ q 



Let 



so that 




x 

q 




+ (l-A)h P 

pq 



x = H P (A,x) and x = H q (A,x). 

p pq - q pp 



Similarly as in Case 1.1. 






pq 



- H p 2 0 « 2h p - 2v(o) ^ 0, 



qq 



which is true since h P = a 

pq pq 



pq 

= V. Next 



H F - H f = A(h F -h F ) = A(a -a ) £ 0, 

pq pp pq pp pq pp 



so that 




= min max H 
3 i 



P 

ij 



(2.23) 



(2.24) 



(2.25) 



Similarly 
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since 



and 



H q - H q k 0 « h P - h q ^0 



pp qp 



pq qp 



h p = a = V k M = a =h q , 

pq pq - — qp qp 



H q - H q £0 o h P - h q £ 0 



pp qq 



pq qq 



since 



a £ a Hence 

pq qq 



= min max . 



PP 



J i 



ij 



and (2 . 23)— (2 .26) imply ft(a) = = V. 

pq ~ 



Case 1,3. 



V < v(a) < V 



Calling again V = a we now take 

- pq 



x = -r^r- (2 v(a) - Xh q ) , 
P 2-X pp 



x = (2v(ct) - Xh P ) . 

q 2-X qq 



We have now 



Next 



H* 



qq 



x = H P (X,x) , x = H q (X ,x) . 

p qq - q pp - 



H P = X[h P - h P + (h q -h P )] 

pq qq pq 2-x pp qq 



so that for small X > 0 



H P - H P ss 0 if and only if 2(h P -h P ) + h q - h P :> 0 

qq pq qq pq pp qq 



2v(o) - 2h F ^ 0, 

pq 



(2.26) 



(2.27) 



(2.28) 



But this is equivalent to 
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which is true since h P = a = V. Similarly 

pq pq ~ 



H P - H P £ 0 

qq pp 



2 (h P -h P ) + h q - h P £ 0 « 2v(a) - 2h P £ 0 

qq pp pp qq pp 



since 



Thus 



, P = 
PP 



a ^ V and v(a) < V. 
PP 



H P = min max H? 



qq 



J 1 



iJ 



(2.29) 



Finally 
jq 



H 



PP 



H q ;> 0 « 2(h q -h q ) + h P - h q ;> 0 « 2v(a) 

qp pp qp qq pp 



2a £0 

qp 



since a = M, and 

qp 



H q - H q £ 0 o 2 (h q -h q ) + h P 

pp qq pp qq qq 



h H ^ 0 « 2 v(o) 

PP 



- 2h H £0 

qq 



since v(a) < V ^ a . Hence 

qq 



I q = 
PP 



min max H . . , 

j i 1J 



(2.30) 



and (2. 27) -(2. 30) yield 



ft(a) = v(a) . 

Case 2 : V = V 

Although we could use the same procedure as in Case 1 again we prefer 

a simpler argument. If V = V = V then the matrix A has a saddle, say 

a = V. Since every n-stage ft-game G (a;k), being a game with perfect 
k-X/ n 

information, has a pure value we must have 



Q (a;k) = V for every n = 1,2,... 
n 
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But by Theorem 1 11m ft (o;x) = ft(cr) regardless of l £ I. Hence 

n 

n-**> 

ft (a) = V for all 0 ^ o ^ 1 and the theorem is proved. 
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3 . Washburn T s Approach . 

We will proceed to prove formula (1.3) by showing that Player 1 
can get at least ^ (a) + 0(l/n) , that Player II can prevent player I 
from getting more than Q^Ccr) + 0(l/n) , and that ft*(a) = = 

ft(a) • The two functions and (a) will be defined within the 

theorem. For future reference, let the clockwise and counterclockwise 
averages be 

v^Ca) = .5((a i:L +a 22 ) (1-a) + (a 12 +a 21 >a) , 

and 

V C( >) = •5((a 11 +a 22 )a + ( a 12 +a 2 i ) (1-cr) ) . 



Theorem: Formula (1.3) is true. 



Proof: We will show that players I and II can asymptotically confine them- 

selves to four strategies each, those strategies being the four functions 
mapping {1,2} onto itself. For example, the strategy (2,1) for I 
means that I will always choose 2 if II has previously chosen 1, or 
1 if II has previously chosen 2. We will use the symbol s ■£ x to mean 
that the strategy s for I assures that the average payoff to I will be 
at least x + 0(l/n), where n is the number of moves, regardless of 
the starting state. Similarly for II. 

We proceed by showing that 
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(1,2) + min{v c (a) ,a l;L ,a 22 } 

(2.1) + min{v cc (o) ,a 12 ,a 21 } 

(1.1) + min{a i;L ,a } 

(2.2) min{a 21 ,a 22 } 

(1.2) +. max{v cc (a) ,a u ,a 22 } 

(2.1) max{v c (a) ,a 12 ,a 21 > 

(1.1) j*j. max{a 11 ,a 21 } 

(2.2) max{a 12 ,a 22 > . 

for I. After n stages, let 
(i,j) initiated by I(II) . 
of the starting point, 

- 1 

- 1 

1 



For example, consider the strategy (1,2) 
n'!' . (n^) be the number of visits to state 



(1,2), 


regardless 


I 


ii 




II 


n II 


£ n 21 


+ 


n ll 


I 


ii 




II 


n 22 


* n 12 


+ 


n 22 




II 


II 




|n 21 n 12 


1 * 



"21 ' ° 



n 12 ’ 0 



n n 11 = „ . 
i i 



Since the average payoff to I is 



( 1 , 2 ) 



= — l l (n* (1-a) + a) 
n V i ij ii 



i J 



ij 



l ir 
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a lower bound on ^ can be obtained by solving a linear programming 

problem. Furthermore, if we define 



1 - 1 / , II II/ 

p.. = n../n and p.. = n. ./n, 
ij iJ ij 



and neglect terms that are 0(l/n), the program is 



min £ Tlp^-Cl-a) + p** a]a 
L L ij ij ij 



= v 



( 1 , 2 ) 



subject to 



I II L II 

P 11 = P 21 + P 11 



I _ II , II 

P 22 P 12 + P 22 



II _ II 
P 21 P 12 



I - 1 - n 

P 21 P 12 ° 



1 1 p u ‘ 1 • 



There are six equations in eight variables. There is an optimal solution 
in which at least two of the variables, in addition to and p^> 

are 0. If these two variables are p^J and p^* th en P^ = P^ = 

p 12 ’ p 22 = ' 5 and V (1 ,2) is v o (a) ' If hi ' °\ v (l,2) * a 22' 

and if p^ = 0, ^ = a^. ^ P 21 = V (1 2) ^" S some ra i xture °f 

a^^ and a 22' cases > (1»2) min{v^ (a) ,a^ >^22^ * Essentially, 

II has his choice of either going around in a clockwise circle or else 
acepting one of the two payoffs that I keeps trying to obtain. The rest 
of the -* statements can be obtained similarly. 

Taking account of all four of I’s (II 1 s) strategies, we see that 
I (II) can guarantee that the payoff will be at least (at most) 
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fZ(cr) + 0(l/n) and + 0(l/n), where 

n I (a) = max{min(v cc (a),a 12 ,a 21 ),min(v c (a),a ,a ),V} 

n i:L (a) = min{max(v (a) ,a ,a ) ,max{v (a) ,a , a. } ,V} 

cc 11 ZZ c 1Z 21 

The fact that ft^(a) = ft^(a) = ft (a) can be shown by exhaustion of cases. 

Since ft^(a) ^ V and ft^(o) £ V, the proof is trivial if V_ = V. If 

there is no saddle point, suppose a^ or is the largest element 

M. It follows that the term involving V CC ( Q ) may deleted from both 

ft^(a) and ft^(a), that the second largest element V is on the same 

diagonal, and also that v(o) = v (a). The result is that 

c 

ft^(a) = max{min{v(a) ,V} ,V} 
ft^^(a) = min{max{v(a) ,V} ,V} 

Both of these expressions are equivalent to 1.3. A similar proof holds 
if a^ or a 2 i t ^ ie l ar g est element. 

QED 
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4 . Comparison of Techniques and Prognosis for Larger Games . 

The 2x2 game would be much easier to solve if it could be 

discovered that the function ft (a) satisfies some simple functional 

equation that has a unique solution. This does not seem to be the case. 

Shubert deals with this problem by inventing a situation that is almost 

as good. He introduces a function ft (a ,X ,k) (x^(X) in his notation) 

which does satisfy such a functional equation, and then shows that 

lim ft(a,X,k) = ft(a) regardless of the index k. He next shows there 
X+0 

are only a finite number of candidates for the solution of the equation, 
one of which must apply for all X smaller than some positive Xq. 

Given the matrix A, he finds the solution, and consequently ft(a) . 

This procedure is probably generalizable to larger games. The difficulty 
will be that the number of candidate solutions will grow very fast with 
the size of the matrix. It is possible, however, that a systematic 
procedure can be found to determine the correct solution without an 
exhaustive search. 

Washburn uses a time-honored technique for solving two-person 
zero-sum games; he guesses the optimal strategies and shows that they are 
in equilibrium. The generalization of his procedure when I has m 
strategies and II has n strategies would be 

1) Let I (II) confine himself to all functions mapping (l,...,n(m)} 
onto {1 , . . . ,m(n) } . There are m n (n m ) such strategies. 

2) For each such strategy, find the asmyptotic bound that it guarantees 
for the player who is using it by writing a Linear Program with 2MN 
variables . 



3) Show that the greatest lower bound equals the least upper bound 
(a theorem is required) . 
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Given the theorem in 3) , one could find the Q-value of an m x n game 
for one particular value of a by solving min{m n ,n m } Linear Programs 
with 2mn variables each. 

It is apparent that either method for finding the ft-value of a 
large game would require a vast amount of computational effort, even if 
all the required theorems can be proved. The central problem for matrix- 
differential games would therefore seem to be the invention of a practical 
procedure for finding solutions; it would be particularly valuable if 
the procedure were able to find solutions for all a. By "practical," 
we mean a procedure where computational effort increases less than 
exponentially with m and n. 
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